STIC Biotechnology Systems Branch 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 
Source: 

Date Processed by STIC: 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 




TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.4.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downIoads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 
U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, VA 22314 

Revised 01/10/06 



Raw Sequence Listing Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES: 

1 Wrapped Nucleics 

Wrapped Aminos 



mCH WERE INSERTED BY PTO SOFTWARE 



SUGGESTED CORRECTION SERIAL NUMBER 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH 

The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 Misaligned Amino 

Numbering 

4 Non-ASCII 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 



Variable Length Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 



_PatentIn 2.0 
"bug" 



_Skipped Sequences 
(OLD RULES) 



A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Sequence(s) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



8 Skipped Sequences Sequence(s) 

(NEW RULES) 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 



9 Use of n's or Xaa's 

(NEW RULES) 




12 Patentln 2.0 



"bug" 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. , 
In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Use of <220> t o <223 > is MANDATORY if <213> "Organis m" response is "Artificia l Sequence" or 
"Unkndwn^ PT ease explain source of genetic material in <220> to <223> sec tion. 
(See "Federal Register," 06/01/1998, Vol. 63, No/104, pp. 29631-32) (Sec. 023 of Sequence Rules}, 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 




13 Misuse of n/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/549 , 352A 



DATE: 11/28/2006 
TIME: 14:28:06 



3 
4 
5 
7 
9 
11 
12 
14 
15 
17 
18 
20 
22 
24 
25 
26 
27 
29 
30 
32 
33 
35 
37 
39 
41 
43 
45 
47 
49 
51 
53 
55 



<120> 
<130> 
<140> 
<141> 
<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<400> 



in Plants 



Input Set : E:\70237USPCT Second Substitute Seq List.txt 
Output Set: N:\CRF4\11282006\J549352A.raw 



<110> APPLICANT: DRAKE, Caroline Rachel 
PAINE, Jacqueline Ann Mary 
SHIPTON, Catherine Ann 

TITLE OF INVENTION: Enhanced Accumulation of Carotenoids 
FILE REFERENCE: 7023 7USPCT 

CURRENT APPLICATION NUMBER: US 10/549, 352A 
CURRENT FILING DATE: 2005-09-14 
PRIOR APPLICATION NUMBER: PCT/GB2004/001241 
PRIOR FILING DATE: 2004-03-24 
PRIOR APPLICATION NUMBER: US60/457,053 
PRIOR FILING DATE: 2003-03-22 

NUMBER OF SEQ ID NOS : 3 8 ^rC9-^ 
SOFTWARE: Patent In version 3.2 * \ 
SEQ ID NO: 1 ^\ c n ^^\(^ 

LENGTH: 563 0 
TYPE: DNA 
ORGANISM 
FEATURE : 

OTHER INFORMATION 
SEQUENCE : 1 





gttaatcatg gtgtaggcaa cccaaataaa 

tattctgtag tacagacaaa actaaaagta 

acaatatcat gagtaatgtg tgagcattat 

gagtcgtgta tcctcgatga gcctcaaaag 

caatgtgcaa agtttgcatt ctccactgac 

tagcaactca tgcatcatat catgcctctc 

agtatcttca gctaaatgtt agaacataaa 

gtgacacatg acaaatcaca gactcaagca 

tccagagcta tatgtcatat tgcaaaaaga 

aaaaattcat ttgcctttcg tgtcaaaaag 

gcaaaagaaa gagagaaaga acaacacaat 

atcattattc atccaccttt cgtgtaccac 

57 ggacattaac aaactctatc ttaacattta 

59 cacgatgatt tctcattgtt tctcacaaaa 

61 aattcggctt cccgggtaca gggtaaattt 

63 gacccttttc tctttttatt tttttgagct 

65 aattgattgg ttatcgtgta aatattacat 

67 tcgtgtgtct ttgatcatct tgatagttac 

69 atcgccgcca ccatggcttc tatgatatcc 

71 tctagggggc aatccgccgc agtggctcca 

73 ccagtgaaga aggtcaacac tgacattact 

75 tgcatgaaac caactacggt aattggtgca 

77 ctacaagctg cggggatccc cgtcttactg 



acaccaaaat atgcacaagg cagtttgttg 
atgaaagaag atgtggtgtt agaaaaggaa 
gggaccacga aataaaaaga acattttgat 
ttctctcacc ccggataaga aacccttaag 
ataatgcaaa ataagatatc atcgatgaca 
tcaacctatt cattcctact catctacata 
cccataagtc acgtttgatg agtattaggc 
agataaagca aaatgatgtg tacataaaac 
ggagagctta taagacaagg catgactcac 
aggagggctt tacattatcc atgtcatatt 
gctgcgtcaa ttatacatat ctgtatgtcc 
acttcatata tcatgagtca cttcatgtct 
gatgcaagag cctttatctc actataaatg 
agcattcagt tcattagtcc tacaacaacg 
ctagtttttc tccttcattt tcttggttag 
ttgatctttc tttaaactga tctatttttt 
agctttaact gataatctga ttactttatt 
agaaccgtcg actctagaga agccatttaa 
tcttccgctg tgacaacagt cagccgtgcc 
ttcggcggcc tcaaatccat gactggattc 
tccattacaa gcaatggtgg aagagtaaag 
ggcttcggtg gcctggcact ggcaattcgt 
cttgaacaac gtgataaacc cggcggtcgg 



or> error 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/549 , 352A 



DATE: 11/28/2006 
TIME: 14:28:06 



Input Set : E:\70237USPCT Second Substitute Seq List.txt 
Output Set: N:\CRF4\11282006\J549352A.raw 



79 gcttatgtct acgaggatca ggggtttacc tttgatgcag gcccgacggt tatcaccgat 1440 

81 cccagtgcca ttgaagaact gtttgcactg gcaggaaaac agttaaaaga gtatgtcgaa 1500 

83 ctgctgccgg ttacgccgtt ttaccgcctg tgttgggagt cagggaaggt ctttaattac 1560 

85 gataacgatc aaacccggct cgaagcgcag attcagcagt ttaatccccg cgatgtcgaa 162 0 

87 ggttatcgtc agtttctgga ctattcacgc gcggtgttta aagaaggcta tctgaagctc 1680 

89 ggtactgtcc cttttttatc gttcagagac atgcttcgcg ccgcacctca actggcgaaa 1740 

91 ctgcaggcat ggagaagcgt ttacagtaag gttgccagtt acatcgaaga tgaacatctg 1800 

93 cgccaggcgt tttctttcca ctcgctgttg gtgggcggca atcccttcgc cacctcatcc 1860 

95 atttatacgt tgatacacgc gctggagcgt gagtggggcg tctggtttcc gcgtggcggc 1920 

97 accggcgcat tagttcaggg gatgataaag ctgtttcagg atctgggtgg cgaagtcgtg 1980 

99 ttaaacgcca gagtcagcca tatggaaacg acaggaaaca agattgaagc cgtgcattta 2040 

101 gaggacggtc gcaggttcct gacgcaagcc gtcgcgtcaa atgcagatgt ggttcatacc 2100 

103 tatcgcgacc tgttaagcca gcaccctgcc gcggttaagc agtccaacaa actgcagact 2160 

105 aagcgcatga gtaactctct gtttgtgctc tattttggtt tgaatcacca tcatgatcag 2220 

107 ctcgcgcatc acacggtttg tttcggcccg cgttaccgcg agctgattga cgaaattttt 2280 

109 aatcatgatg gcctcgcaga ggacttctca ctttatctgc acgcgccctg tgtcacggat 2340 

111 tcgtcactgg cgcctgaagg ttgcggcagt tactatgtgt tggcgccggt gccgcattta 2400 

113 ggcaccgcga acctcgactg gacggttgag gggccaaaac tacgcgaccg tatttttgcg 2460 

115 taccttgagc agcattacat gcctggctta cggagtcagc tggtcacgca ccggatgttt 2520 

117 acgccgtttg attttcgcga ccagcttaat gcctatcatg gctcagcctt ttctgtggag 2580 

119 cccgttctta cccagagcgc ctggtttcgg ccgcataacc gcgataaaac cattactaat 2640 

121 ctctacctgg tcggcgcagg cacgcatccc ggcgcaggca ttcctggcgt catcggctcg 2 700 

123 gcaaaagcga cagcaggttt gatgctggag gatctgattt gaggccatgc aggccgatcc 2760 

125 ccgatcgttc aaacatttgg caataaagtt tcttaagatt gaatcctgtt gccggtcttg 2820 

127 cgatgattat catataattt ctgttgaatt acgttaagca tgtaataatt aacatgtaat 2880 

129 gcatgacgtt atttatgaga tgggttttta tgattagagt cccgcaatta tacatttaat 2940 

131 acgcgataga aaacaaaata tagcgcgcaa actaggataa attatcgcgc gcggtgtcat 3000 

133 ctatgttact agatcgggcc ttaataagct tgttaatcat ggtgtaggca acccaaataa 3060 

135 aacaccaaaa tatgcacaag gcagtttgtt gtattctgta gtacagacaa aactaaaagt 3120 

137 aatgaaagaa gatgtggtgt tagaaaagga aacaatatca tgagtaatgt gtgagcatta 3180 

139 tgggaccacg aaataaaaag aacattttga tgagtcgtgt atcctcgatg agcctcaaaa 3240 

141 gttctctcac cccggataag aaacccttaa gcaatgtgca aagtttgcat tctccactga 3300 

143 cataatgcaa aataagatat catcgatgac atagcaactc atgcatcata tcatgcctct 3360 

145 ctcaacctat tcattcctac tcatctacat aagtatcttc agctaaatgt tagaacataa 3420 

147 acccataagt cacgtttgat gagtattagg cgtgacacat gacaaatcac agactcaagc 3480 

149 aagataaagc aaaatgatgt gtacataaaa ctccagagct atatgtcata ttgcaaaaag 3540 

151 aggagagctt ataagacaag gcatgactca caaaaattca tttgcctttc gtgtcaaaaa 3600 

153 gaggagggct ttacattatc catgtcatat tgcaaaagaa agagagaaag aacaacacaa 3660 

155 tgctgcgtca attatacata tctgtatgtc catcattatt catccacctt tcgtgtacca 3720 

157 cacttcatat atcatgagtc acttcatgtc tggacattaa caaactctat cttaacattt 3780 

159 agatgcaaga gcctttatct cactataaat gcacgatgat ttctcattgt ttctcacaaa 3840 

161 aagcattcag ttcattagtc ctacaacaac gaattcggct tcccgggtac agggtaaatt 3900 

163 tctagttttt ctccttcatt ttcttggtta ggaccctttt ctctttttat ttttttgagc 3960 

165 tttgatcttt ctttaaactg atctattttt taattgattg gttatcgtgt aaatattaca 4020 

167 tagctttaac tgataatctg attactttat ttcgtgtgtc tttgatcatc ttgatagtta 4080 

169 cagaaccgtc gactctagag aagccattta aatcgccgcc accatggcca tcatactcgt 4140 

171 acgagcagcg tcgccggggc tctccgccgc cgacagcatc agccaccagg ggactctcca 42 00 

173 gtgctccacc ctgctcaaga cgaagaggcc ggcggcgcgg cggtggatgc cctgctcgct 4260 

175 ccttggcctc cacccgtggg aggctggccg tccctccccc gccgtctact ccagcctgcc 4320 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/549 , 352A 



DATE: 11/28/2006 
TIME: 14:28:06 



177 
179 
181 
183 
185 
187 
189 
191 
193 
195 
197 
199 
201 
203 
205 
207 
209 
211 
213 
215 
217 
219 
222 
223 
224 
225 
227 
228 
230 
231 
233 
235 
237 
239 
241 
243 
245 
247 
249 
251 
253 
255 
257 
259 
261 
263 
265 
267 
269 



Input Set : E:\70237USPCT Second Substitute Seq List.txt 
Output Set: N:\CRF4\11282006\J549352A.raw 



cgtcaacccg gcgggagagg 
caagcaggcc gcattgctca 
ggacatggac atgccacgca 
tgaggagtat gccaagacgt 
cgccatatgg gccatctatg 
cgccaactac attacaccaa 



ccgtcgtctc gtccgagcag aaggtctacg acgtcgtgct 
aacgccagct gcgcacgccg gtcctcgacg ccaggcccca 
acgggctcaa ggaagcctac gaccgctgcg gcgagatctg 
tttacctcgg aactatgttg atgacagagg agcggcgccg 
tgtggtgtag gaggacagat gagcttgtag atgggccaaa 
ccggtgggag aagagacttg aggatctgtt 
cgctctctct gataccatct caaggttccc 
tgaagggatg aggagtgatc ttaggaagac 
gtactgctac tatgttgctg gaactgtcgg 



aaccgagtct 
gaaccaactc 
tttaccacaa 
cgtcacgaac 
tgaggaggca 
ttccctgttg 
cacgaagagg 
aaaatcgcta 



aaagcaacaa ctgaaagcgt 
acgaacatac tccgggatgt 
gatgagcttg cacaggcagg 
cggtggagaa acttcatgaa 
gagagagggg taactgagct 
ttgtacaggc agatcctgga 
gcgtatgttg gtaaagggaa 
ctgctcccat gttcattgag 
atccccgatc gttcaaacat ttggcaataa 
cttgcgatga ttatcatata atttctgttg 
taatgcatga cgttatttat gagatgggtt 
taatacgcga tagaaaacaa aatatagcgc 
tcatctatgt tactagatcg 



cagctttgga 

cacgggacgt ccttacgaca tgcttgatgc 
catagacatt cagccattca gggacatgat 
aaggtataac aacttcgacg agctctacat 
gttaatgagc gtacctgtga tgggcatcgc 
atacagtgct gccttggctc tgggaattgc 
tggagaggat gctagaagag gaaggatata 
gctctctgat gaggacatct tcaaaggggt 
gaggcagatc aagagggcca ggatgttttt 
ctcacaggct agcagatggc cagtatgggc 
tgagatcgaa gccaacgact acaacaactt 
gaagttgcta gcacttcctg tggcatatgg 
aaatggccag acctagggcc atgcaggccg 
agtttcttaa gattgaatcc tgttgccggt 
aattacgtta agcatgtaat aattaacatg 
tttatgatta gagtcccgca attatacatt 
gcaaactagg ataaattatc gcgcgcggtg 
<210> SEQ ID NO: 2 
LENGTH: 563 0 

TYPE: PNfl — — ^ 

ORGANISM: ^Artificial Sequence 

FEATURE: ^ 

OTHER INFORMATION, 
SEQUENCE : 2 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 
tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 
acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 
gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 
caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 
tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact catctacata 
agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 
gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac 
tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 
ttgcctttcg tgtcaaaaag aggagggctt 
gagagaaaga acaacacaat gctgcgtcaa 
cgtgtaccac acttcatata 
ttaacattta gatgcaagag 
tctcacaaaa agcattcagt 
gggtaaattt ctagtttttc 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5630 



<211> 
<212> 
<213> 
<220> 
<223> 
<400> 




12421 



tccagagcta 
aaaaattcat 
gcaaaagaaa 
atcattattc 
ggacattaac 
cacgatgatt 
aattcggctt 
gacccttttc 
aattgattgg 
tcgtgtgtct 
atcgccgcca 
tctagggggc 



atccaccttt 
aaactctatc 
tctcattgtt 
cccgggtaca 
tctttttatt 
ttatcgtgta 
ttgatcatct 



tttttgagct 
aatattacat 
tgatagttac 



ccatggcttc tatgatatcc 
aatccgccgc agtggctcca 



tacattatcc 
ttatacatat 
tcatgagtca 
cctttatctc 
tcattagtcc 
tccttcattt 
ttgatctttc tttaaactga 
agctttaact gataatctga 
agaaccgtcg actctagaga agccatttaa 
tcttccgctg tgacaacagt cagccgtgcc 
ttcggcggcc tcaaatccat gactggattc 



atgtcatatt 
ctgtatgtcc 
cttcatgtct 
actataaatg 
tacaacaacg 
tcttggttag 
tctatttttt 
ttactttatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/549 , 352A 



DATE: 11/28/2006 
TIME: 14:28:06 



Input Set : E:\70237USPCT Second Substitute Seq List.txt 
Output Set: N:\CRF4\11282006\J549352A.raw 



2 71 ccagtgaaga aggtcaacac tgacattact tccattacaa gcaatggtgg aagagtaaag 

2 73 tgcatgaaac caactacggt aattggtgca ggcttcggtg gcctggcact ggcaattcgt 

2 75 ctacaagctg cggggatccc cgtcttactg cttgaacaac gtgataaacc cggcggtcgg 

2 77 gcttatgtct acgaggatca ggggtttacc tttgatgcag gcccgacggt tatcaccgat 

2 79 cccagtgcca ttgaagaact gtttgcactg gcaggaaaac agttaaaaga gtatgtcgaa 

281 ctgctgccgg ttacgccgtt ttaccgcctg tgttgggagt cagggaaggt ctttaattac 

283 gataacgatc aaacccggct cgaagcgcag attcagcagt ttaatccccg cgatgtcgaa 

285 ggttatcgtc agtttctgga ctattcacgc gcggtgttta aagaaggcta tctgaagctc 

287 ggtactgtcc cttttttatc gttcagagac atgcttcgcg ccgcacctca actggcgaaa 

289 ctgcaggcat ggagaagcgt ttacagtaag gttgccagtt acatcgaaga tgaacatctg 

291 cgccaggcgt tttctttcca ctcgctgttg gtgggcggca atcccttcgc cacctcatcc 

293 atttatacgt tgatacacgc gctggagcgt gagtggggcg tctggtttcc gcgtggcggc 

295 accggcgcat tagttcaggg gatgataaag ctgtttcagg atctgggtgg cgaagtcgtg 

297 ttaaacgcca gagtcagcca tatggaaacg acaggaaaca agattgaagc cgtgcattta 

2 99 gaggacggtc gcaggttcct gacgcaagcc gtcgcgtcaa atgcagatgt ggttcatacc 

301 tatcgcgacc tgttaagcca gcaccctgcc gcggttaagc agtccaacaa actgcagact 

303 aagcgcatga gtaactctct gtttgtgctc tattttggtt tgaatcacca tcatgatcag 

305 ctcgcgcatc acacggtttg tttcggcccg cgttaccgcg agctgattga cgaaattttt 

307 aatcatgatg gcctcgcaga ggacttctca ctttatctgc acgcgccctg tgtcacggat 

309 tcgtcactgg cgcctgaagg ttgcggcagt tactatgtgt tggcgccggt gccgcattta 

311 ggcaccgcga acctcgactg gacggttgag gggccaaaac tacgcgaccg tatttttgcg 

313 taccttgagc agcattacat gcctggctta cggagtcagc tggtcacgca ccggatgttt 

315 acgccgtttg attttcgcga ccagcttaat gcctatcatg gctcagcctt ttctgtggag 

317 cccgttctta cccagagcgc ctggtttcgg ccgcataacc gcgataaaac cattactaat 

319 ctctacctgg tcggcgcagg cacgcatccc ggcgcaggca ttcctggcgt catcggctcg 

321 gcaaaagcga cagcaggttt gatgctggag gatctgattt gaggccatgc aggccgatcc 

323 ccgatcgttc aaacatttgg caataaagtt tcttaagatt gaatcctgtt gccggtcttg 

325 cgatgattat catataattt ctgttgaatt acgttaagca tgtaataatt aacatgtaat 

327 gcatgacgtt atttatgaga tgggttttta tgattagagt cccgcaatta tacatttaat 

32 9 acgcgataga aaacaaaata tagcgcgcaa actaggataa attatcgcgc gcggtgtcat 

331 ctatgttact agatcgggcc ttaataagct tgttaatcat ggtgtaggca acccaaataa 

333 aacaccaaaa tatgcacaag gcagtttgtt gtattctgta gtacagacaa aactaaaagt 

335 aatgaaagaa gatgtggtgt tagaaaagga aacaatatca tgagtaatgt gtgagcatta 

337 tgggaccacg aaataaaaag aacattttga tgagtcgtgt atcctcgatg agcctcaaaa 

339 gttctctcac cccggataag aaacccttaa gcaatgtgca aagtttgcat tctccactga 

341 cataatgcaa aataagatat catcgatgac atagcaactc atgcatcata tcatgcctct 

343 ctcaacctat tcattcctac tcatctacat aagtatcttc agctaaatgt tagaacataa 

345 acccataagt cacgtttgat gagtattagg cgtgacacat gacaaatcac agactcaagc 

347 aagataaagc aaaatgatgt gtacataaaa ctccagagct atatgtcata ttgcaaaaag 

349 aggagagctt ataagacaag gcatgactca caaaaattca tttgcctttc gtgtcaaaaa 

351 gaggagggct ttacattatc catgtcatat tgcaaaagaa agagagaaag aacaacacaa 

353 tgctgcgtca attatacata tctgtatgtc catcattatt catccacctt tcgtgtacca 

355 cacttcatat atcatgagtc acttcatgtc tggacattaa caaactctat cttaacattt 

357 agatgcaaga gcctttatct cactataaat gcacgatgat ttctcattgt ttctcacaaa 

359 aagcattcag ttcattagtc ctacaacaac gaattcggct tcccgggtac agggtaaatt 

361 tctagttttt ctccttcatt ttcttggtta ggaccctttt ctctttttat ttttttgagc 

363 tttgatcttt ctttaaactg atctattttt taattgattg gttatcgtgt aaatattaca 

365 tagctttaac tgataatctg attactttat ttcgtgtgtc tttgatcatc ttgatagtta 

367 cagaaccgtc gactctagag aagccattta aatcgccgcc accatggcca tcatactcgt 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
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369 acgagcagcg tcgccggggc tctccgccgc 
371 gtgctccacc ctgctcaaga cgaagaggcc 
373 ccttggcctc cacccgtggg aggctggccg 
375 cgtcaacccg gcgggagagg 
377 caagcaggcc gcattgctca 
379 ggacatggac atgccacgca 
381 tgaggagtat gccaagacgt 
3 83 cgccatatgg gccatctatg 
3 85 cgccaactac attacaccaa 
3 87 cacgggacgt ccttacgaca 
3 89 catagacatt cagccattca gggacatgat 
3 91 aaggtataac aacttcgacg agctctacat 
393 gttaatgagc gtaccagtga tgggcatcgc 
395 gtacagtgct gccttggctc tcggaattgc 
3 97 tggagaggat gctagacgag gaaggatata 
3 99 gctctctgat gaggacatct tcaaaggggt 
gaggcagatc aagagggcca ggatgttttt 
ctcacaggct agcagatggc cagtatgggc 
tgagatcgaa gccaacgact acaacaactt 



401 
403 
405 
407 
409 
411 
413 
415 
417 
420 
421 
422 
423 
425 
426 
428 
429 
431 
433 
435 
437 
439 
441 
443 
445 
447 
449 
451 
453 
455 
457 
459 
461 



cgacagcatc agccaccagg 
ggcggcgcgc cggtggatgc 
tccctccccc gccgtctact 
gtccgagcag aaggtctacg 
gcgcacgccg gtcctcgacg 
ggaagcctac gaccgctgcg 
tttacctcgg aactatgttg atgacagagg 
tgtggtgtag gaggacagat gagcttgtag 
ccggtgggag aagagacttg 
cgctctctct gataccatct 
tgaagggatg aggagtgatc 
gtactgctac tatgttgctg 
atccgagtct aaagcaacaa 
gaaccaactc acgaacatac 
tttaccacaa gatgagcttg 
cgtcacgaac cggtggagaa 
tgaggaggca gagagagggg 
ttccctgttg ttgtacaggc 
cacgaagagg gcgtatgttg 



ccgtcgtctc 
aacgccagct 
acgggctcaa 



cagctttgga 
tgcttgatgc 



gaagttgcta gcacttcctg tggcatatgg aaaatcgcta ctgctcccat 



aaatggccag acctagggcc atgcaggccg 
agtttcttaa gattgaatcc tgttgccggt 



atccccgatc gttcaaacat 
cttgcgatga ttatcatata 



aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat 



taatacgcga tagaaaacaa 
tcatctatgt tactagatcg 



<212> 
<213> 
<220> 
<223> 
<400> 



tttatgatta gagtcccgca attatacatt 
gcaaactagg ataaattatc gcgcgcggtg 
<210> SEQ ID NO: 3 
<211> LENGTH: 5180 
TYPE: DNA 

ORGANISM: Artificial Sequence 

FEATURE : ^ 
OTHER INFORMATION :/^1242 2 J <\V~WV * 
SEQUENCE: 3 ^ 
gttaatcatg gtgtaggcaa cccaaataaa acacc^a^at ^atgcacaagg 
tattctgtag tacagacaaa actaaaagta atgaaagaa^ atgtggtgtt 
acaatatcat gagtaatgtg tgagcattat 
gagtcgtgta tcctcgatga gcctcaaaag 
caatgtgcaa agtttgcatt ctccactgac 
tagcaactca tgcatcatat catgcctctc 
agtatcttca gctaaatgtt agaacataaa 
gtgacacatg acaaatcaca gactcaagca 
tccagagcta tatgtcatat tgcaaaaaga 
aaaaattcat ttgcctttcg tgtcaaaaag 
gcaaaagaaa gagagaaaga acaacacaat 
atcattattc atccaccttt cgtgtaccac 
ggacattaac aaactctatc ttaacattta 
cacgatgatt tctcattgtt tctcacaaaa 
aattcggctt cccaaatcgc cgccaccatg gcttctatga tiatcctcttc 
acagtcagcc gtgcctctag ggggcaatcc gccgcagtgg ctccattcgg 
tccatgactg gattcccagt gaagaaggtc aacactgaca ttacttccat 



ggactctcca 
cctgctcgct 
ccagcctcgc 
acgtcgtgct 
ccaggcccca 
gcgagatctg 
agcggcgccg 
atgggccaaa 
aggatctgtt 
caaggttccc 
ttaggaagac 
gaactgtcgg 
ctgaaagcgt 
tccgggatgt 
cacaggcagg 
acttcatgaa 
taactgagct 
agatcctgga 
gtaaagggaa 
gttcattgag 
ttggcaataa 
atttctgttg 
gagatgggtt 
aatatagcgc 



gggaccacgj 
ttctctcac< 
ataatgcaa< 
tcaacctatt^ 
cccataagtc 
agataaagca 




aataaaaaga 
ccggataaga 
ataagatatc 
cattcctact 
cgtttgatg 
aatgatgtg 



ggagagctta t:aagacaagg 
aggagggctt t;acattatcc 
gctgcgtcaa Utatacatat 
acttcatata Ucatgagtca 
gatgcaagag cctttatctc 
agcattcagt 1 cattagtcc 



1 



cagtttgttg 
agaaaaggaa 
acattttgat 
aacccttaag 
atcgatgaca 
catctacata 
agtattaggc 
tacataaaac 
catgactcac 
atgtcatatt 
ctgtatgtcc 
cttcatgtct 
actataaatg 
tacaacaacg 
cgctgtgaca 
cggcctcaaa 
tacaagcaat 



4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5630 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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